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Spontaneous electroweak symmetry breaking (EWSB) 
has been introduced in the standard model (SM) to ex- 
plain how fundamental particles acquire mass. In the 
SM, EWSB requires the existence of a yet unobserved 
scalarparticle, the Higgs boson [lj-Q- Direct searches at 
LEP [J] , combined with the recent exclusion mass ranges 
provided by the Tevatron Q and LHC experiments j(J, 
have reduced the possible values for the mass of the SM 
Higgs boson. The Higgs mass is now excluded at 95% 
confidence level (C.L.) below 114 GeV/c 2 and between 
141 and 476 GeV/c 2 . A global fit to electroweak measure- 
ments Q provides indirect indications that confirm the 
region below 158 GeV/c 2 as the most probable. There- 
fore the searches in the low mass decay channels play 
a crucial role to set a final statement on the SM Higgs 
boson existence. 

Although the Higgs boson decay into a pair of r lep- 
tons is not the dominant process in the low mass regime, 
searches in this final state are important for several rea- 
sons. First, the inclusion of additional channels improves 
overall sensitivity of the search. Also, once the Higgs bo- 
son is discovered, measurements of its decay branching 
ratios (B.R.'s) into different channels can give important 
information about its true nature. Furthermore, when 
beyond SM scenarios are considered, such the minimal 
supersymmetric extension of the SM (MSSM) Q, the 
study of final states containing r leptons attracts even 
more interest, given their enhanced coupling to the Higgs 
bosons in large regions of the model parameter space. 

A previous search for the SM Higgs boson in the final 
states with r leptons was performed by the DO Collab- 
oration Q. This Letter presents results of a search for 
the H -* tt decays in the final state with two r leptons 
and one or more jets, using a data sample corresponding 
to an integrated luminosity of 6.0 ftr 1 of pp collisions 
collected at the Tevatron by the CDFII detector at y/s 
= 1.96 TeV. The search is sensitive to contributions of 
several Higgs boson production mechanisms: gluon-gluon 
fusion gg -* H , associated production with the W or Z 
boson (VH) , and the vector boson fusion (VBF) qHq' -> 
qrrq' . To improve the search sensitivity a novel strategy 
for the t identification is proposed, based on a multivari- 
ate technique [Io|]- Tau leptons are short-lived particles 
(t = 290.6 x 10" 15 s [HI), which can be detected only 
through their decay products. Semi-hadronic decays r -* 
XhV T , where Xh is a system of hadrons and v T is the t 
neutrino, are denoted in this paper as Th and correspond 
to a B.R. of about 65%. With leptonic decays 77 -* lviv T 
accounting for the remaining 35%, the final states t^t;, 
studied in this paper include about 46% of all possible 
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decays of r p airs. 

CDFII [lj-ll4| is an azimuthally and forward-backward 
symmetric general-purpose particle detector. The geom- 
etry is described using the azimuthal angle tp and the 
pseudo-rapidity r) = -lntan(0/2), where 9 is the polar 
angle measured with respect to the proton beam axis z. 
A charged-particle tracking system, immersed in a 1.4 T 
solenoidal magnetic field, consists of an inner set of silicon 
micro-strip detectors i\r\\ < 2), and a open-cell drift cham- 
ber (I77I < 1). Electromagnetic and hadronic sampling 
calorimeters surround the tracking system and cover cen- 
tral (\r)\ < 1.1) and forward (1.1 < \rj\ < 3.6) regions. Pro- 
portional chambers and scintillating strip detectors, em- 
bedded in the electromagnetic section of the central and 
forward calorimeters correspondingly, measure positions 
and transverse profiles of the showers. Dedicated muon 
detectors located outside the calorimeter provide cover- 
age for I77I < 1.5. Gas Chcrenkov counters, installed in 
the forward region 3.7 < \r]\ < 4.7, measure the average 
number of inelastic interactions per bunch crossing, pro- 
viding a measurement of the instantaneous luminosity. 
The components of energy and momenta in the plane 
transverse to the beam are defined as Et = E$'m9 and 
Pt = psin9. Since neutrinos do not interact with the de- 
tector components, their Et must be inferred from the 
energy imbalance in the transverse plane, T (l5| . 

Data for this search have been collected using a trigger 
selection which requires one central electron or muon 
candidate with p^ > 8 GeV/c and one additional isolated 
track with pt > 5 GeV/c [16| . which is used as a starting 
point for the Th reconstruction. 

Jets are reconstructed from clustered energy deposi- 
tions in the calorimeter towers using a fixed cone al- 
gorithm [ijj with a radius in the tp-r] space AR = 

Aip 2 + A77 2 < 0.4. Jets are required to have \rj\ < 2.5, an 
electromagnetic fraction E em /E < 0.9, and Et > 20 GeV, 
where the transverse energy is corrected for instrumental 
effects and multiple pp collisions in the event [17 1 . 

At the analysis level, an electron candidate is recon- 
structed as a calorimeter cluster with Et > 10 GeV 
pointed to by a well-measured track. The candidate is 
required to have a shower mostly contained in the elec- 
tromagnetic compartment, and the lateral shape of the 
shower must be consistent with test beam measurements. 
A muon candidate is reconstructed from a track with pt > 
10 GeV/c, matching to the track segments in the muon 
chambers. Muon energy deposition in the calorimeter 
should be consistent with that of a minimum-ionizing 
particle. Electron and muon reconstruction is described 
in detail in Ref. [3] . 

In the detector, semi-hadronic decays of the r leptons 
manifest themselves as narrow jets with low track multi- 
plicity. A Th candidate is required to have a good quality 
track with pr > 6 GeV/c, pointing to a calorimeter clus- 
ter with E T > 9 GeV and \r/\ < 1.0. Momenta of the 
charged and neutral particles reconstructed around the 
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highcstpT track in the cluster (referred to as the seed 
track [18|). define the "visible" four- momentum of the 
Th candidate, p V i S . The transverse component of p V i S , 
or visible px, is required to be greater than 15 GeV/c. 
Momenta of the neutral pions and photons are recon- 
structed using the central shower maximum detector. To 
be consistent with the topology of the hadronic t decay, 
the number of tracks associated with the Th candidate 
is required to be 1 or 3. Tau decay modes with higher 
charged track multiplicity, contributing of the order of 
1%, are not considered. Contamination from electrons 
and muons mimicking the Th signature is reduced by re- 
jecting candidates with Eh a d/P < 0.2, where Ehad is the 
hadronic part of the Th cluster energy, and P is the scalar 
sum of all track momenta associated with the Th candi- 
date. To discriminate between taus and jets originat- 
ing from quarks or gluons (referred to as QCD jets), the 
reconstructed Th candidates are required to be isolated. 
This is achieved by demanding that no charged or neutral 
particles are reconstructed in the isolation annulus [3| 
defined around the seed track, or by requiring the sum of 
their measured px to be below a given threshold. In this 
search, the discrimination between 77, 's and QCD jets is 
further enhanced by using a multivariate selection based 
on a set of Boosted Decision Trees (BDTs) [H[20|. Sev- 
eral Th categories are defined according to the number of 
tracks and to the visible px of the Th candidate, and spe- 
cific BDTs are implemented for each category. The input 
variables which provide the largest discrimination power 
are the Th visible mass, M V j S = -J p^ is , and the number 
of charged and neutral particles reconstructed in the iso- 
lation annulus, as well as the sum of their px- Compared 
to the cut-based Th selection 2l|, the BDT-based selec- 
tion reduces the jet -* Th misidentification rate by 25% 
while maintaining the same identification efficiency, at a 
level of 35% for p T > 30 GeV/c. 

The two identified r leptons in the event must have 
opposite charges and be spatially separated, Ai? > 0.4. 
In addition, selected events are required to have a good 
quality vertex with \V Z \ < 60 cm, where V z is the vertex 
z-coordinate. 

The irreducible background contributions to this 
search originate from Z+jets, ti and diboson events with 
one Th and an electron (or muon) in the final state. 
Another source of background events is due to lepton 
misidentification in 7+jet, QCD multijet and VF+jets 
production. Z -*■ ee/fifi events can satisfy the selec- 
tion criteria when one of the two electrons or muons is 
mistakenly identified as a Th candidate. This contribu- 
tion is suppressed by vetoing events containing a Th with 
Ehad/P < 0.4 and the dilepton invariant mass M; T con- 
sistent with that of the Z boson. 

The signal and background estimates are based on 
a combination of Monte Carlo (MC) simulations and 
data-driven methods. The detector response is sim- 
ulated with a GEANT3-based package [22j. Higgs, ti 
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FIG. 1: Invariant mass of the two most energetic jets, in the 
iV(jets) > 2 channel. 



and diboson production processes are generated by 
PYTHIA Q with the CTEQ5L set of parton distribu- 
tion functions (PDF's) [24| and their contributions nor- 
malized to the next-to-leading order (NLO) theoretical 
cross sections [25l - [29j . The Z+jets background is gen- 
erated using ALPGEN (30[, matched with PYTHIA for the 
hadronization and parton showering. Relative contribu- 
tions of the different Z+n partons subprocesses are pro- 
vided by ALPGEN, with the total event yield normalized 
to the measured Z 



11 production cross section [31 
Additional correction factors, determined by comparing 
data and simulations in specific control samples, are ap- 
plied to account for the observed mis- modeling of the e//i 
->• Th misidentification probabilities. Background contri- 
butions resulting from lepton misidentification are esti- 
mated using a data-based method. In 7+jet and QCD 
multijet events no significant correlation is expected be- 
tween the charges of the two reconstructed lepton candi- 
dates, thus leading to an equal amount of opposite-sign 
(OS) and same-sign (SS) events with similar kinematic 
features. The 7+jet and QCD multijet contributions in 
the OS event selection are then modeled using the SS 
data sample. In case of the IF+jets process there is a 
correlation between the charge of the lepton produced in 
the W boson decay and the charge of the outgoing quark 
which generates the jet misidentified as the Th', this re- 
sults in an excess of OS events. This OS/SS asymmetry, 
A = 2.2 ± 0.5, is measured in a W+jets enriched con- 
trol sample obtained by loosening the Th selection and 
by applying a cut on If T > 20/30/35 GeV (for events 
with A (jets) = 0, 1 or > 2, respectively) and on the re- 
constructed W transverse mass, Mx > 40 GeV/c [32j |. 

We split the analysis sample into three subsamples ac- 
cording to the number of reconstructed jets in the final 
state. Events with A (jets) = are used to validate the 
background model. The Th identification efficiency, e, is 
measured in a high purity Z -> tt sample, defined by 
events with A(jcts) = 0, $ T > 10 GeV and M T < 60 
GeV/c 2 . The ratio of the e measurements in the data 
and MC, SF r = 0.96 ± 0.05, is used to correct the ac- 
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TABLE I: Expected event yields in the two signal channels, 
compared to the number of observed events. Quoted errors 
include both systematic and statistical uncertainties. 



Process 



A (jets) = 1 A(jets) > 2 



Z — > TT 


935 ± 85 


160 ± 24 




30.6 ± 3.9 


6.8 ± 1.0 


Z -* (ifi 


15.5 ± 2.7 


1.9 ± 0.5 


WW/WZ/ZZ 


9.3 ± 1.0 


2.2 ± 0.3 


ti 


11.3 ± 1.6 


39.3 ± 4.6 


jet-> r fakes 


1219 ± 124 


171 ± 19 


VK+jets 


151 ± 38 


46.8 ± 14.1 


Total background 


2371 ± 156 


428 ± 36 


99 -ff 


1.42 ± 0.45 


0.34 ± 0.24 


WH 


0.23 ± 0.03 


0.39 ± 0.05 


ZH 


0.13 ± 0.02 


0.25 ± 0.03 


VBF 


0.18 ± 0.02 


0.25 ± 0.03 


Total signal (M H = 120 GeV/c^) 


1.96 ± 0.45 


1.23 ± 0.29 


Data 


2517 


462 



ccptancc. Expected signal and background yields and the 
observed data in the iVQets) = 1 and TV (jets) > 2 channels 
are summarized in Table |U Figure [1] shows the invariant 
mass of the two most energetic jets in the TV (jets) > 2 
channel; since the hadronic decay of the W or Z boson is 
reconstructed, this variable is one of the most powerful 
in discriminating the associated Higgs boson production 
signal from the largest backgrounds. 

Several sources of systematic uncertainties affect both 
the signal and background estimates. The largest one is 
the uncertainty on the jet energy scale [T3|, which leads 
to acceptance variations up to a 20% level in different 
jet multiplicity channels. Non-negligible effects on the 
shapes of different jet-related kinematic distributions are 
also observed and taken into account. Another relevant 
source of systematic uncertainty which affects the sample 
normalizations is the theoretical and experimental uncer- 
tainty on the cross sections: uncertainties on diboson and 
ti production are 6% and 10% respectively 2oT- 27l while 
for Drell-Yan processes we take a value of 2.2% [31|. The 
gg -* H production depends strongly on the gluon parton 
density function and the accompanying value of a s (q 2 ). 
The theoretical uncertainty on the cross section of this 
process, calculated separately for different jet multiplic- 
ity channels, results in uncertainties of 23.5% and 67.5% 
on the expected yield of the signal events with iV(jets) 
= 1 and TV (jets) > 2, respectively [11]. The cross sec- 
tion uncertainties for the other Higgs boson production 
mechanisms are 10% for VBF and 5% for VH [H, Hif. 
The uncertainties on the identification efficiency and 
on the e -*■ Th and (i -*■ misidentification probabilities, 
are 5%, 7%, and 15% respectively. Systematic uncertain- 
ties due to the modeling of the initial state (ISR) and 
final state radiation (FSR) have been evaluated for all 
the Higgs boson production processes: the largest varia- 
tion, 15%, corresponds to the gluon fusion process. The 



choice of the PDFs affects the acceptance at the level of 
a few percent, while an uncertainty of 2.3% on the inclu- 
sive Drell-Yan acceptance is derived from the differences 
between the observed ALPGEN and PYTHiA-based calcu- 
lations. A 10% uncertainty on the contribution of events 
with jets misidentified as Th is estimated from SS data. 
The uncertainty of the VF+jets background is estimated, 
using OS/SS asymmetry, at 25% (30%) for W + 1 (> 2) 
jet channel. 

After all selection cuts are applied, the background ex- 
pectation is still significantly larger than the expected 
Higgs boson signal. The dominant irreducible back- 
ground contribution, Z-» tt, is topologically very sim- 
ilar to the signal, thus affecting the sensitivity of the 
search; in addition, the Higgs boson invariant mass re- 
constructed from the visible decay products of the two 
r leptons suffers from a very poor resolution, given that 
a sizeable fraction of the r energy is carried by the un- 
detected neutrinos. Since no other single powerful vari- 
able is available, a multivariate approach is followed and 
the discriminating power of various kinematic and topo- 
logical observables is combined into one single output 
discriminant. The search strategy is based on two se- 
quential BDT-based selections applied to the candidate 
events. The first set of BDTs is trained to discriminate 
H -» tt from W+jets, QCD, Z ee/fi/i and ti back- 
grounds. At this step an event is accepted if its BDT 
score is higher than an optimal value, estimated by max- 
imizing the search sensitivity. The second set of BDTs is 
trained to discriminate the signal against Z -* tt. The 
choice and the number of input variables is optimized for 
each BDT, considering the most discriminating among a 
set of 23 well-modeled kinematic quantities. The train- 
ing procedure is performed separately for the iV(jets) = 1 
and AT (jets) > 2 channels and optimized for two different 
Higgs boson mass scenarios (Mff = 120 GcV/c 2 and Mh 
= 140 GeV/c 2 ). Results are shown in Fig. [5] for a Higgs 
boson mass of 120 GeV/c 2 . 

The observed number of events in the two channels, 
as well as the distributions in the BDT output variables, 
are consistent with the SM background expectations, so 
we use the BDT output templates to set 95% C.L. upper 
limits on the SM Higg s boson production. A Bayesian 
likelihood method [ll| with a flat prior assigned to the 
signal cross section is employed, and all statistical and 
systematic shape and rate uncertainties are appropriately 
incorporated. The two channels are combined, and the 
resulting limits are shown in Fig. [3] and summarized in 
Table ITT1 For all mass values, the expected and observed 
limits are within one standard deviation from each other. 

We presented in this Letter the results of the search for 
the Higgs boson in the tt decay mode, using final states 
involving one hadronically decaying r lepton. This mea- 
surement contributes to the sensitivity of the low mass 
Higgs boson searches at the Tevatron. The value of the 
expected (observed) limit for a Higgs boson mass of 120 
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FIG. 2: Final discriminant distributions for the AT (jets) = 1 and AT (jets) > 2 jets channels, obtained by combining a set of 
BDTs trained to discriminate a 120 GeV/c 2 Higgs boson signal from the main sources of background. 



TABLE II: Expected and observed 95% C.L. upper limits on a(pp -> H)B.R.(H -> rr), relative to the theoretical SM prediction. 



M H (GeV/c 2 ) 


100 


105 


110 


115 


120 


125 


130 


135 


140 


145 


150 


Expected 


18.9 


17.5 


15.7 


15.2 


15.3 


16.9 


19.4 


23.6 


29.7 


40.9 


62.0 


Observed 


19.6 


18.1 


15.5 


14.7 


14.6 


16.4 


18.7 


22.6 


33.7 


47.3 


70.2 



GeV/c 2 is 15.3 (14.6) times the SM expectation. 
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